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ABSTRACT 

We present a catalog of galaxy clusters selected using the maxBCG red- 
sequence method from Sloan Digital Sky Survey photometric data. This catalog 
includes 13,823 clusters with velocity dispersions greater than ^400 km s" 1 , and 
is the largest galaxy cluster catalog assembled to date. They are selected in an 
approximately volume-limited way from a 0.5 Gpc 3 region covering 7500 square 
degrees of sky between redshifts 0.1 and 0.3. Each of these clusters contains 
between 10 and 190 E/S0 ridgeline galaxies brighter than 0.4 L* within a scaled 
radius R/joo- The tight relation between ridgeline color and redshift is used to 
determine photometric redshift estimates for every cluster. In addition, SDSS 
spectroscopic redshifts are available for at least the brightest galaxy in 39% of 
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the clusters. Tests of purity and completeness are obtained by running the cluster 
finder on realistic mock catalogs. These studies suggest that the sample is more 
than 90% pure and more than 85% complete for clusters with masses > 1 x 10 14 
solar masses. Photometric redshift errors are shown by comparison to spectro- 
scopic redshifts to be small (A z ~ 0.01), essentially independent of redshift, and 
well determined throughout the redshift range. Spectroscopic measurements of 
cluster members are used to examine the extent to which projection along the 
line of sight contaminates identification of brightest cluster galaxies and cluster 
member galaxies. Spectroscopic data are also used to demonstrate the correlation 
between optical richness and velocity dispersion. Comparison to the combined 
NORAS and REFLEX X-ray selected cluster catalogs shows that X-ray lumi- 
nous clusters are found among the optically-richer maxBCG clusters. This paper 
is the first in a series which will consider the properties of these clusters, their 
galaxy populations, and their implications for cosmology. 

Subject headings: galaxies: clusters: general, catalogs 



Introduction 



Galaxy clusters are the most visible features of large scale structure. They occupy 
very massive dark matter halos, and are observationally accessible by many means. Their 
locations are revealed by the presence of large numbers of tightly clustered galaxies, pools 
of hot X-ray emitting gas, and relatively strong features in the gravitational lensing shear 
field. Many of their properties, including their number as a function of dark matter mass and 
their spatial clustering, can be predicted with confidence from N-body simulations. Since 
clusters occupy the tail of the halo mass function, their numbers are exponentially sensitive 
to variations in cosmology. Precise observations of large numbers of clusters provide an 
important tool for testing our understanding of cosmology and structure formation. Clusters 
are also interesting laboratories for the study of galaxy evolution under the influence of 
extreme environments. 



Clusters were first detected in the 18th century (jBivianol 120001 ) as significant overdensi- 
ties of galaxies on the sky. The early 20th century saw the first explorations of their physical 



prop erties, including the discovery of dark matter in Zwicky's study of Coma (IZwicky 



1933 



19371 ). Statistical studies of the cluster population became possible with the introduction 
of large area s urveys implementing uniform selecti on methods. For example, Abell and his 
collaborators (lAbell. G.O.I Il958l ; lAbell et al.l Il989l ) created a sample of 4073 clusters over 
the whole sky by visually inspecting photographic plates. They identified clusters as over- 
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densities of 30 or more galaxies (each no more than two magnitudes fainter than the third 
brightest member, m 3 ) within a fixed metric aperture on the sky. Distances were estimated 
based on the magnitude of the tenth brightest clust er member. S i milar catalogs, with some- 
what different selection criteria, were developed by lZwicky et al.l (Il96ll ). 



The Abell catalog has been very influential (see, e.g., iBahcalll Il999l . and references 
therein), supporting a wide range of cluster science. Among its most important achievements, 



i t enabled some of the fi r st studies of the large-scale distri 



( iBahcall fc Soneiral Il983l ; iPostman et al.l Il992l ; iMiller et a 



jution of matter in the Universe 
1999h . The brightest galaxies in 



clust e rs have also been used to construct Hubble diagrams (IGunn fc Okdll975l ; ISandage et al. 
19761 ; IPostman &: Lauerl Il995f). Studies of the Abell cluster c atalogs have be en hampere d 
by problems of projection (jSutherlandlll988l ; ICollins et al.lll995l ). false clusters (jLuceylll983l ). 
and an uncertain selection function. Projection refers to the unwanted effect of unassociated 
foreground and background galaxies on the measured properties of clusters (richness, HI3, 
etc.), while 'false clusters' are objects overdense in projection, but not physically bound. 
Uncertainties in the selection function arise from the use of visual selection and the relatively 
poor calibration of photographic data. 

These problems prompted the construction of new optically-selected sample s of clusters 
using digitized photographic material and automated cluster selection algorithms (ILumsden et al 
19921 ; iDalton et al.lll994t iGal et al.ll2003l ). In recent years, the increased availabilit y of large- 



area CCD photometry has prompted a new round of optica l cluster selection (e.g.. lGalll2006 
and references therein). For example. IPostman et al.l (119961 ) constructed a sample of clusters 
using a "matched filter" technique, smoothing the galaxy distribution with a filter optimized 
for the detection of distant clusters. Another example i s the direct search for clusters as re 



solve d sources of diffuse optical light (IDalcantonl Il996l ; IZaritsky et al.lll997l ; iGonzalez et al. 
200lh . 



The advent of multi-color CCD photometry obtained over large areas has had a major 
impact. Precise measurement of galaxy color has literally added a new dimension to optical 
cluster finding. Clusters are dominated by old, red E/SO ga laxies which occup y a narrow 
region in color- magnitude space known as the E/SO ridgeline (IBower et al.lll992l ). This tight 
clustering o f galaxies in color, magnitude, and space allows significant improvements in clus- 
ter fin ding JOstrander et al.lll998l : lGladders fc Y^boodBahcall et alihooi bladders fc Yee 
20051 ). The location of this ridgeline in color shifts smoothly with redshift, providing quite 
precise estimates of cluster redshift. These new "red sequence" techniques have largely elim- 
inated projection effects and false clusters from optical cluster catalogs, making it possible 
to perform accurate measurements of the large-scale structure in the universe using such 
samples. 
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In parallel with advances in optical searches for clusters, there has been consider- 
able progress in cluster identification in X-rays. Over the last de cade, a variety of clus- 



ter catalogs have been constructed from X-ray surveys of the sky ( Gioia fc Luppinol 



Ebeling et al.lll996l; iRosati et al.lll998t iRomer et al.ll2000l ; IBohringer et al 



2000 



1994 



Mullis et al. 



20031 : IBohringer et al.l I2004J ) These surveys are less sensitive to projection effects, though 
they can be contaminated by AGN emission. In addition, the relation of X-ray luminosity 
and temperature to the underlying total mass of a cluster is thought to be more accessible 
to theoretical prediction than optical mass proxies. The principle limitation of X-ray cluster 
surveys is the relative scarcity of the observing resource. Existing all-sky surveys are limited 
to detection of rare, high-flux sources. As a result, the total number of groups and clusters 
yet detected in X-rays is modest; there are only 1579 objects i n the BAX databa se, a com- 
pendium of all X-ray groups and clusters detected as of 2004 (jSadat et al.l l2004f ) . With no 
future all-sky X-ray survey planned, it is unlikely the number of X-ray detected clusters will 
increase substantially over the next decade, though importa nt contributions wil l be made by 



serendipitous surveys in t he XMM and Chandra archives (IRomer et al.l 1200 ll ; iGreen et al. 
20041 : iRomano et al.l 120041 ) . 



Clus ters have al s o been detected as pea k s in the shear field of deep weak lensing 



surveys ( 


Er 


3en et al.l 


2000; 


Clowe et a 




2001 


Dahle et al. 


2003; 


Wittman et al. 


2003. 


2005). 



Miralles et al. 2002: Wittman et al. 2001 



ing the projected surface mass density of the clusters. Unfortunately, this projection in- 



troduces substantial nois e in the mass estimation for individual clusters (IWhite et al.l 12002 



de Putter &: Whitdl2005l ) . While lensing provides a unique test for the presence of truly dark 
clusters, it is otherwise an expensive detection method. To detect a cluster using lensing, 
images must be obtained of a large number of faint galaxies in the background of a cluster. 
The same images used to detect the shear always contain very high signal-to-noise detections 
of many cluster galaxies, galaxies which would have been easily detected in much shallower 
imaging. While lensing is probably not an optimal me t hod for cluster detec t ion, is is a vi 



tal to ol in cluster mass calibration (jSheldon et al.ll200ll : I Johnston et al.ll2005l : ISheldon et al. 
20061 ) for clusters detected by any other method. 



In the future, large cluster catalogs will be constructed using observations of the Sunyaev- 
Zeldovich (SZ) effect. The X-ray emitting hot gas in clusters causes Compton up-scattering 
of a small fraction of the CMB photons which pass through it. This l eads to a characteristic 
distortion in the transmitted CMB spectrum (jCarlstrom et al.l l2002f ) . Observation of these 
distortions provides an additional method for cluster detection. The SZ effect nicely com- 
bines the guarantee of a deep potential well provided by the hot gas along with a detection 
signature which is essentially independent of redshift. Of course SZ surveys must be supple- 
mented by follow-up optical observations to determine redshifts if they are to be useful for 
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cosmology. They will also benefit from the mass calibrations which weak lensing measure- 
ments provide. As a result, combined SZ and optical surveys are a particularly promising 
approach for cluster study in the coming years. 

The most significant challenge in cluster science lies in relating what is best under- 
stood about clusters from theory to what is measured in observations. Structure formation 
simulations can predict the evolutio n of the dark matter component of clusters with great 
confidence (e.g.. lEvrard et al.ll2002l ). Observations provide measures of the cluster galaxies 
and gas with ever-increasing precision. Unfortunately, the cluster baryons we observe partic- 
ipate in hydrodynamic and thermodynamic interactions which are too complex for current 
simulations to confidently model. This leaves an uncomfortably uncertain gap between the- 
ory and observation. Closing this gap is the goal of a substantial body of current research. 
New simulati ons of many kinds are pushing the ability to predict the evolution of galaxies 
and gas (e.g JSpringel et alJliooi iKravtsov et alJbood ) while new observations are extract 



ing ever more direct and precise measurements of total cluster mass prox ies (IVikhlinin et al. 



2006; Diaferio et al. 2005 



Maiumdar & Mohr! 12004 ; iKatgert et al.ll2004f ). 



This paper describes a large new ca talog of galaxy cl usters extracted from Sloan Digital 
Sky Survey optical imaging data (SDSS: lYork et al.ll2000l ). It is the first in a series of papers 
which will explore the properties of optically-selected SDSS cluster samples and their use 
for cosmology. Galaxy clusters are selected by calculating the likelihood that each observed 
galaxy is the brightest galaxy in a cluster. This likelihood is based on the galaxy's color 
and magnitude, along with the degree to which other galaxies are clustered around it in 
color, magnitude, and space. The resulting catalog is an essentially volume-limited list of 
galaxy group and cluster locations together with estimates of their total galaxy content and 
redshift. We present here a catalog of the richest objects detected by this method, including 
13,823 clusters, each containing ten or more E/S0 ridgeline galaxies brighter than 0.4 L* (in 
the i-band) within a scaled radius R.2oo> drawn from about 7500 square degrees of sky, and 
extending over a redshift range from z=0. 1-0.3. 

Section [2] presents a brief description of the SDSS data relevant for this paper. This 
is followed in Section [3] by an outli ne of the cluster sel ection algorithm, details of which 
are presented in a companion paper (IKoester et al.ll2006l ). Overall properties of the derived 
cluster catalog are presented in Section [U Basic tests of the completeness and purity of 
cluster selection based on mock catalogs are described in Section [51 Tests of the quality 
of redshift estimates, the importance of projection effects, the relationship between cluster 
richness measures and velocity dispersion, and the space density of clusters are outlined in 
Section [61 As an additional test, comparison of this catalog to existing X-ray selected cluster 
catalogs is provided in Section [71 We conclude in Section [HI Where needed, we assume a 
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standard ACDM cosmology with Qm = 0.3, Q\ = 0.7, and Hq = 70 km s 1 Mpc 1 . 



Data 



Data for this study are drawn from Sloan Digital Sky Survey^: a combined imaging and 
spectroscopic survey of 10 4 deg 2 in the North Galactic Cap and a smaller region i n the South. 



The imaging survey was carried out using a specially designed 2.5 m telescope (IGunn et al. 
2006T) in drift-scan mode in five SPSS fil t ers (u, q, r, z, z) to a limiting magnitude of r<22.5 



(|Fukugita et all 119961 : IGunn et all 119981 : iLupton et all 1 19991 : iHogg et all 1200 ll : ISmith et al. 
20021 ) . Photometric erro rs are typically lim ited at bright magnitudes by systematic uncer- 



tainties at the <3% level (llvezic et al.ll2004f) . Astrometric errors are typically smaller than 50 
mas per coordinate (IPier et al.ll2003l ). The spectroscopic s urvey targets both a 'main' sample 



of ga laxies with r<17.8 and a median redsh ift of z~0.1 (IStrauss et al.l 120021 ; iBlanton et al. 



20031 ) and a 'luminous red galaxy' sample (jEisenstein et al.l 120011 ) which is approximately 
vol ume limited out to z=0 .38. For more details of early and mor e recent SDSS data releases 



sec 



Stoughton et all ((20021) and lAdelman-McCarthv et al.l (120061 ) 



Two catalogs are extracted from the SDSS data for use in this paper. The first is a 
photometric galaxy catalog, used a s input to the c l uster finder. This catalog is generated 
using the same criteria applied in IScranton et al.l (120021 ) to create the input catalog for 
measurements of galaxy clustering, including use of an optimized star-galaxy separator. 
The input catalog includes galaxy positions and 'CMODEL' magnitudes in each of the 
SDSS bands. These magnitudes are constructed from a weighted combination of Petrosian 
and model magnitudes, with the weights determined by the quality of individual fits of 
deVaucoleurs and exponential profiles to the surface brightness profile of ea ch galaxy. All 



magni tudes are corrected for Galactic extinction using the extinction maps of ISchlegel et al. 
( 119981 ). The resulting input catalog contains 2.3 x 10 7 galaxies. The distribution of these 
galaxies in g — r color and i magnitude is shown in Figure [TJ 

The galaxies in our imaging catalogs include a wide range of objects with various colors, 
magnitudes, and morphologies. Only a subset of these have colors and magnitudes consistent 
with E/S0 ridgeline galaxies in the target redshift range. Imposing broad color cuts to exclude 
galaxies too red, blue, or faint to be bright cluster members substantially reduces the size 
of the input catalog with no loss in cluster finding efficiency The colors and magnitudes 
appropriate for E/S0 galaxies in the redshi ft range 0.1 < z < .3 are extracted using the 
color-magnitude-redshift model described in lKoester et al.l (120061 ). We focus on this redshift 
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range for reasons explained in more detail in Section [H These model parameters are turned 
into color-magnitude cuts in the following way: at an assumed redshift, we note the g — r 
and r — i colors and 0.4 L* i-band magnitude prescribed by the model. We include in the 
search catalog all galaxies whose colors (g — r) candidate and (r — i) can didate pass the following 
criteria anywhere in the search redshift range: 



(g - r)(z) model - y/a*. r + 0.15 2 < (g - r) candidate < (g - r)(z) modd + ^ 'a 2 err + 0.15 2 (1) 
(r - i){z) model - ^a 2 rr + 0.18 2 < (r - i) candid ate < (r - i){z) mode i + a/ 'cr 2 err + 0.18 2 

^candidate ^ 0.4 L^Z) 

Here, the o -2 ^ a re the color errors of the objects, measured using the methods described in 



Scranton et al.l (120051 ). The factors of 0.15 and 0.18 each correspond to 3 times the intrinsic 
width of the E/S0 ridgeline in g — r and r — i, respectively. At the same redshift, a magnitude 
constraint selects objects whose i-band magnitudes indicate that they are 0.4 L* or brighter 
at the current redshift. Running these selection criteria for the chosen redshift range reduces 
the total number of input galaxies to 4,689,495, or about one fifth of the input galaxy 
catalog (see Figured] for the range of these cuts in g — r vs. i, excluding color errors). These 
galaxies are all treated as potential centers of groups or clusters by the maxBCG cluster 
finder described below. We note that this will bias the cluster search against completely 
blue, low redshift clusters and groups (Section 3). Most of these input galaxies (a total of 
1,389,858) are ultimately absorbed as centers or members of maxBCG groups and clusters. 

The second input catalog is drawn from the SDSS spectroscopic data, including all 
available spectroscopic targets identified as galaxies with confidently measured redshifts. 
The catalog used here contains 567,486 galaxies with typical radial velocity errors of about 
±30 km s _1 . This spectroscopic catalog is not used in the detection of clusters, but provides 
vital data for testing the fidelity of photometric redshift estimation. Spectroscopic data 
also allows us to determine the relationship between measured cluster richness and velocity 
dispersion. The g — r color as a function of redshift for this spectroscopic catalog is shown 
in Figure [2j 



3. Description of the Cluster Detection Algorithm 



Galaxy cluster detection within the SDSS has been presented in several prev i ous w orks. 
Initial efforts at pho tometric selection of clusters were presented in iGoto et al.l (120021 ) and 
Bahcall et al.l (120031 ). The latter compared the results of two different selection methods, 
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a hybrid matched filter method (iKim et all 120021 ) and an earlier version of the "maxBCG" 
method presented here (lAnnis et al.lll999l). A pho tometrically selected catalog of compact 
groups has also been assembled (ILee et al.l 120041 ) . In addition to these photometrically 
selected catalogs, s everal group and cl us ter catalogs have bee n asse mbled from spectroscopi c 
data, for example iMiller et al.l (120051 ). iBerlind et al.l (120061 ). and IWeinmann et al.l (120061 ). 
These spectroscopic catalogs are quite robust, but limited in volume by the flux-limited 
nature of the SDSS spectroscopy. Previously detected clusters have also been studied using 
SDSS optical data. Popesso and collaborators have published a se r ies of pape rs relating 



X-ray and optical properties of Abell clusters (IPopesso et al 



2004 



2005|, |2006|). Studies 



have also been condu cted of the galaxy populations ([Hansen et al.ll2005l ) and lensing masses 
( ISheldon et al.ll200ll ) of an earlier generation of ma xBCG objec t s. Lu minosity functions for 
photometrically selected clusters were considered in lGoto et al.l (120031 ). 



The maxBCG galaxy cluster selection algo rithm applied to the input galaxy catalog is 
described in more detail in a companion paper (IKoester et al.l 120061 ) . Briefly, the algorithm 
exploits two well-known features of rich galaxy clusters. First, the bright end of the cluster 
luminosity function is dominated by galaxies occupying a narrow region of color-magnitude 
space (the E/S0 ridgeline). These galaxies are sometimes referred to as red-sequence galaxies. 
Second, clusters contain a brightest cluster galaxy (BCG) that is located near the center of 
the galaxy distribution. This BCG is of ten distinctly brighter than other cluster members 
(IHansen et al.ll2005l; lLoh fc Straussll2006l ). and nearly at rest relative to the cluster center of 
mass (jOegerle fc Hillll200lf ). While these criteria are not universal for all groups and clusters, 
they become increasingly common among the optically richest clusters. 

For every galaxy in the input catalog, the algorithm measures two independent likeli- 
hoods. The first is the likelihood that a galaxy is spatially located in an overdensity of E/S0 
ridgeline galaxies with similar g-r and r-i colors, and the second is the likelihood that it has 
the color and magnitude properties of a typical BCG. Both likelihoods are evaluated for 
every input SDSS galaxy at a grid of redshifts. The redshift which maximizes the product 
of these likelihoods is then found for each galaxy. This corresponding maximum likelihood 
redshift provides a first estimate of the cluster redshift, and is used in the following steps. 

Next a list of member galaxies for each potential center is assembled. The number of 
galaxies projected within 1 Yc x Mpc of this potential center, brighter than 0.4 L*, fainter 
than the potential center, and with colors matching its E/S0 ridgeline, is counted. This 
number, which we call N ga i, provides a first estimate of cluster richness. This initial richness 
estimate is then used to estimate cluster size R.2oo> defined here as the radius within which 
the density of galaxies with —24 < M r < —16 is 200 times the mean density of such galaxies. 



This R200 is determined using the relation between ~N ga i and R200 derived in IHansen et al. 
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( 120051 ). This richness-size relation was determined for a somewhat different, earlier version 
of the maxBCG cluster finder. While the normalization of the richness-size relation may 
differ slightly in this new catalog, we expect the overall scaling to remain the same. 

Once the list of cluster center likelihoods is measured and lists of corresponding poten- 
tial members are assembled, galaxy clusters are assembled beginning with the richest and 
progressing down through a percolation procedure. The percolation begins with the highest 
likelihood potential center. This galaxy is declared a cluster BCG and assigned its maximum 
likelihood redshift. All galaxies within a projected separation R 2 oo of the BCG, within ±2cr 
of the E/SO ridgeline in the space of g — r and i magnitude, and brighter than 0.4 L* in i 
band are labeled as members of this cluster. Subsequent galaxies can be considered BCGs 
only if they are not already members of a higher likelihood center. We further eliminate 
any lower likelihood centers which fall within R 20 o and have a maximum likelihood redshift 
within ±0.02 of a higher likelihood BCG. This latter step prevents the double counting which 
might occur if a real member of rich cluster has slightly anomalous colors which prevent it 
from being selected as a cluster member. Note that this local suppression has important 
implications for measurement of the spatial clustering of clusters. The observed clustering 
will be strongly affected by the cluster finding algorithm on scales less than a few Mpc. This 
process continues until all viable centers are either declared centers or otherwise absorbed 
as cluster members. Each object detected has a center defined as the BCG location, an 
estimated redshift, and a richness given by the number of E/S0 ridgeline members brighter 
than 0.4 L* and within R 20 o of the cluster center (N^ 00 ). 

Once the basic cluster finding step is complete, we refine the measurement of cluster 
properties in several ways. First, redshift estimates are adjusted. A small empirical cor- 
rection (« 0.004) is applied to the photometric redshifts, based on the comparison of spec- 
troscopic to photometric redshifts described below. Information about the spectroscopic 
redshift of the cluster, where it is available, is appended. The luminosity of the BCG and 
the summed luminosity of the BCG and all cluster members in r and i bands is added. 
Luminosities are reported in units of 1O 1O L , k-corrected to z=0.25, the median redshift of 
the sample. K-correct ions are computed using the LRG template in v4.1.4 of KCORRECT 



(IBlanton et al.ll2003bl ). assuming the maxBCG photometric redshift. These k-corrections do 
not include an evolution correction. The final cluster catalog contains an array of measured 
properties, including location, photometric redshift, spectroscopic redshift (where available), 
and several richness and mass estimators, including N 9a ;, N^ 00 , L^ CG , Lf CG , L™ em , and 
jjnem _ o nce ^ e c l us ter catalog has been constructed, many more refined measures of cluster 
properties, including overall richness, shape, concentration and galaxy content can be made. 



In concluding this description, it is worth noting that color cuts and the matched-filter 
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employed in maxBCG restri ct detections of c ompletely blue groups and clusters, especially 
those found at low redshift (IGoto et al.ll2002l ). Furthermore, it assigns lower likelihoods to 
objects with less well-defined ridgelines, unusual BCGs, or BCGs off-center from the cluster 
galaxy distribution. Although maxBCG may not miss such objects, they are of course 
penalized for their deviation from the model. More generally, since a perfect detection 
algorithm does not exist, understanding the selection function of any algorithm requires 
comparison to clusters detected by alternative approaches. In Section 7, X-ray selected 
clusters are compared to th e maxBCG catalog, and simulations are used in Section 5 and 
also in (IKoester et al.ll2006l ) to address the robustness of the detection algorithm. 



4. Overall Properties of the Derived Catalog 

The algorithm described above produces a catalog of all groups and clusters from z=0.05- 
0.35 which contain bright red galaxies, a total of 2.18 x 10 6 objects. The resulting list spans 
a very broad mass range, from isolated ellipticals and small groups through rich clusters. We 
focus here on only the more substantial clusters: those containing at least 10 E/SO ridgeline 
galaxies brighter than 0.4 L* within a scaled aperture R2oo- We focus on a restricted redshift 
range as well. SDSS photometric data are particularly well suited for the measurement 
of clusters at moderate redshift. In the redshift range from 0.1 to 0.3 the relation between 
redshift and g — r color is particularly simple, reflecting the gradual shift of the 4000 A break 
from the blue to the red edge of the SDSS g filter (see Figure[2]). At the lowest redshifts (below 
z=0.1), the approximately constant photometric redshift uncertainties of a z ~ 0.01 imply 
substantial fractional uncertainties in distance, causing correspondingly large uncertainties 
in derived parameters. Beyond z = 0.1, these uncertainties in distance fall below 10%, and 
decrease with increasing distance. In addition, clusters present at these low redshift s can 



be more reliably selected using spectroscopic clustering algorithms (IMiller et al.ll2005l ). At 
redshifts beyond 0.3, the 4000 A break begins to cross into the r filter and for a range of 
redshift from z = 0.32 to z=0.37, there is a significant increase in the photometric redshift 
uncertainty. 

Focusing on the redshift range from 0.1 to 0.3 has another advantage. The cluster 
detection methods described here rely on identifying cluster members brighter than 0.4 L* 
at the cluster redshift. Our input galaxy catalog is essentially complete for these galaxies out 
to a redshift of 0.4, where they have i magnitudes of about 20.5. Cluster galaxies at redshifts 
less than 0.3 have photometric measurement errors in the essential g, r, and i bands which 
rarely exceed 10%. This combination allows our cluster selection across this redshift range 
to be remarkably uniform and allows us to assemble a catalog that is close to volume limited. 
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Residual uncertainty in the evolution of the cluster galaxy luminosity function from z=0.1 
to 0.3, which effects our definition of 0.4 L*, is one of the principle remaining systematic 
errors at this point. 

The catalog presented here includes 13,823 clusters with photometric redshifts 0.1 < 
z < 0.3 and richnesses of 10 <Ng 00 < 190; 2891 of which have Njf > 20. The distribution 
of these clusters in scaled richness N^ 00 is shown in Figure [3] As expected, smaller systems 
dominate the abundance function. The largest systems are often well-known Abell, Zwicky, 
or X-ray selected clusters. The N^ 00 > 10 objects contain 213,016 member galaxies, or a bit 
less than 1% of all input galaxies. Note that the input catalog is apparent magnitude limited, 
rather than volume limited. So these figures do not transparently reflect the probability that 
a galaxy will reside in a rich cluster. 

Among these photometrically selected clusters, a total of 5413 (39%) have BCGs with 
measured spectroscopic redshifts. Only a small fraction, mostly at redshifts less than 0.15, 
have redshifts for more than 5 member galaxies. The lack of multiple redshifts per cluster 
at high redshifts is primarily due to the magnitude limited nature of the SDSS spectroscopic 
survey. The photometric redshift distribution of these clusters is shown in Figure HI Com- 
parison to the expectation for a constant comoving density sample in the standard ACDM 
cosmology in this figure illustrates the approximately volume-limited nature of the catalog. 
A 'pie diagram' for the clusters located in a 2.5° thick slice along the southern celestial 
equator is shown in Figure O and the volume-limited nature of the sample is again evident. 

The full cluster catalog is available as FITS binary and ASCII tables in the online 
edition of this work. A description of the information provided for each cluster is presented 
in Table[TJ 



5. Tests of Cluster Selection in Simulated Data 



The quality of a cluster catalog is often explored in terms of its false positive rate (pu- 
rity) and its failure rate (completeness). Until recently, determining completeness and purity 
has only been possible through first order techniq ues, for example through random inser 



tion of model clusters into real background data (IDiaferio et al.l Il999l ; lAdami et al.l 12000 



Postman et al.ll2002l ; iKim et al.ll2002l ; iGoto et al.l 120021 ) . Modern mock galaxy catalogs, in 
which the galaxy distribution is designed to represent the underlying dark matter distri- 
bution from n-body simulations, embed their galaxy clusters in their full environment of 
filaments and voids, and provide realist ic estimates of both the spatial and dynamical struc- 



ture of the cluster galaxy population (IWhite et al.l 120021 ; iKochanek et al.l 120031 ; lEke et al. 
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2004 ; Miller et al.l 120051 : lYang et all 120051 : iGerke et all l2005t IWechsler et all 120061 ). These 
mock catalogs have enabled a deeper exploration of the purity and completeness of catalogs 
produced by a galaxy cluster finder. Since clusters of galaxies trace the underlying dark 
matter halo population in the universe, the mock galaxy catalogs can also help to reveal how 
well the objects found by the cluster-finder relate to dark matter halos. In the end, it is the 
distribution of these halos in the real universe that we wish to uncover, and the extent to 
which this distribution is recovered from clusters in the mock catalogs is a strong indicator 
of the catalog's quality. This recovery rate also provides useful feedback for adjustments and 
corrections that need to be made to optimize cluster finding algorithms. 

To test t he cluster finde r descr ibed in this paper, we utilize the mock galaxy catalogs 
described by IWechsler et all ( 120061 ). which were largely designed for this purpose. The 
construction of these catalogs begins with an N-body realization of the large scale structure 
in the nearby universe. Galaxies are then inserted into this simulation, adopting the locations 
and motions of dark matter particles, subject to a variety of empirical constraints. The first 
constraint simply determines the number and nature of the galaxies to be inserted. In 
this case, the number of galaxies and their distribution in r luminosit y is taken from the 
measured SDSS galaxy luminosity function (IBlanton et all 1200 ll . 12003a ). Galaxies down to 
0.4 L* in r are then assigned to particular dark matter particles according to a scheme 
which matc hes the observed lum inosity-dependent two-point clustering of SDSS galaxies as 
measured by lZehavi et all (120041 ) . The assignment does not explicitly utilize any knowledge of 
the locations of dark matter halos, but produces a halo occupation that is in good agreement 
with methods that constrain it directly. It has the advantage that it provides a way to 
naturally include the dim background population necessary for comparison to photometric 
data with less resolution than would be required for a method populating dark matter halos 
directly. 

Once a location and r-band luminosity are chosen for each galaxy, appropriate galaxy 
colors are assigned by selecting a real SDSS galaxy with the same luminosity and measured 
local density. This process insures that the simulated galaxy colors and their relation to 
their local galaxy density accurately reflects those found in the data. This is essential if the 
mock catalogs are to be useful for testing an E/S0 ridgeline based cluster finder. If the dark 
matter particle on which a galaxy is placed is within R 2 oo of the center of a dark matter 
halo, the galaxy is considered a member galaxy of that halo. 

The most serious flaw of this process is that it includes no explicit mechanism for 
creating the brightest cluster galaxies which we know often rest in the center of a dark 
matter halo. In a naive effort to fix this, the brightest galaxy in each halo is, as a final step, 
moved to the center of the halo and assigned the mean halo redshift. It may be possible to 
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solve this problem by explicitly associating galaxies with resolved d ark matter substructures 



within halos, as has been done in rece nt high-resolution simulations (INagai &: Kravtsovll2005 



Conroy. Wechsler. &: Kravtsovll2005l ). although it is difficult to do this in simulations with 



the volume necessary for photometric cluster studies. 

Straight-forward techniques are employed to quantify the purity and completeness. 
Clusters in the mock catalog are identified using the same code that is run on the data. 
To understand the false-positive rate (purity), halo membership of galaxies identified as 
red-sequence cluster members is used to determine the cluster-halo correspondence. In this 
comparison, it is important to recall that galaxies identified as cluster members by the 
cluster-finder must be red-sequence members. As such, they are only a subset of the actual 
halo members, some of which fall outside the E/SO ridgeline. This fact affects the matching 
process, guaranteeing that no perfect one-to-one match of cluster and halo members will be 
found. 

To measure purity, we look to see if each identified cluster corresponds to any real 
dark matter halo. For each cluster, the halo containing the maximum fraction of a cluster's 
members is located. If this fraction is less than some threshold, f c , there is no halo to which 
this cluster clearly corresponds, and the cluster is called a false positive. The exact choice 
of f c is arbitrary, and may be driven by the science questions that the catalog is designed 
to answer. The purity results, presented in Figure [6l are based are based on two possible 
choices: f c = 0.3 and 0.5 . At f c = 0.3, the catalog is > 90% pure for N r ™ > 10 and 
95 — 100% pure for N r g ^° > 20. Tightening f c to 0.5 reveals a degraded purity, which is 
approximately constant at 90% across the full richness range. 

To evaluate the completeness, the matching process is reversed. For each halo, we 
identify the cluster that contains the largest fraction of its members. If this fraction is less 
than some threshold, fh, the halo has been missed. Because the member galaxies in the 
mock halos are red and blue (and the cluster-finder only identifies the red ones as cluster 
members), this is a rather strict test. Figure [7| shows the results of these schemes, in which 
fh = 0.3 and 0.5, respectively. The sample is >90% complete above w 2 x 10 14 hr x M 
in the fh = 0.3 test, and is 95-100% complete for objects more massive than 3 x 10 14 hr 1 
M . Notably, both choices of fh yield nearly 100% completeness greater than 8 x 10 14 Yr x 
M Q , with the completeness degrading more rapidly in the larger fh fraction. In this more 
stringent test, 90% completeness is not reached until ~ 3 x 10 14 hr 1 M . 

Insight i nto the cluster-halo correspondence can be gained from considering the meaning 



of f c and fh (IGerke et al.ll2005l ). In measurements of purity, we seek a halo which contains 
a substantial fraction (f c > 0.3) of the cluster's members. When this fraction is small, the 
cluster-finder has merged one or more additional halos with this best match, a failure we 
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might describe as over-merging. The cluster members beyond those in the matched halo are 
actually members of other, nearby halos. If a cluster finder were tuned to maximize purity 
alone, it would endeavor to include as many members of the matching halo as possible, and 
in the process would be more likely to merge galaxies from neighboring halos. Increasing the 
value of f c required for matching ensures that a halo becomes more dominant in the cluster, 
and that the cluster is not a conglomeration of many smaller halos, until f c ~ 1. At this 
point it is possible that the cluster is only part of a much larger halo, and there may in 
fact be several clusters for which this halo is a best match. Similar flexibility in the choice 
and meaning of matching parameters affects completeness measures. In these, a matching 
cluster must contain a substantial fraction (fh > 0.3) of the halo's total member list. The 
other halo galaxies may be distributed among other clusters, in which case the cluster-finder 
has fragmented the halo into several objects. Letting this fraction approach one reduces the 
incidence of fragmentation, but is likely to cause over-merging. A more extensive d iscussion 



of these issues is presented in the companion algorithm paper (IKoester et al.ll2006l ). 

Above N^ a 2 / 00 = 10 and M = 2 x 10 14 h' 1 M the catalog presented here is ~ 90% pure 
and complete. At masses below ~ 1 x 10 14 , the decline in completeness is not surprising. 
The mean number of real halo members within a three dimensional R200 from mock catalogs 
at masses near ~ 1 x 10 14 is about 8, with a tail to higher richness. When the cluster 
finder is run on these halos, it typically increases this true member number by ~ 20% due to 
projection, pushing most 10 14 M halos over the detection threshold. Note that the precise 
values of purity and completeness are sensitive both to how the matching is done and to 
how the limits of the catalog are defined. When completeness is defined without the explicit 
threshold, and using an exclusive matching scheme between halos and clusters, the catalog 
is ~ 95% complete above M = 1 x 10 14 h' 1 M (Rozo et al, in preparation). 

Halos at lower masses include a wide variety of galaxy groups. Some include clear red- 
sequences and are easily detected. Others have larger blue fractions and are more difficult to 
identify by red-sequence techniques. There are indications from the radial pro files of these 
objec ts that low N^ 00 objects come from underdense regions of the Universe ([Hansen et al. 



20051 ) . and may be dominated by fossil groups. The mix of objects found by red sequence 
methods at low N^ 00 makes this an interesting area of future study, as do the objects missed 
by these means. 

An important reason to assemble a cluster catalog is to determine the distribution of 
dark matter halos at different masses. Ideally, there would be a one-to-one correspondence 
between halos in the real Universe and observed clusters of galaxies, with the correspondence 
encoded in a richness-mass relation such as M(N^ 00 ). Calibrating this relation would then 
allow extraction of the true halo distribution from the cluster abundance function. In fact any 
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cluster-finder inevitably fragments some halos into sets of smaller clusters, and merges some 
smaller halos into larger clusters. This complicates cluster-to-halo matching, and illustrates 
the important role which studies of realistic mock catalogs must play: without such studies, 
reliable cosmological constraints from optically-selected catalogs are likely impossible. A 
much more complete analyses of fragmentation, merging, and the richness-mass relations 
required to generate final estimates of the halo mass function will be presented in Rozo et 
al (in preparation). 



6. Tests of Derived Cluster Parameters 

In this section we describe some tests of the quality of derived cluster properties, based 
in this case primarily on reference to SDSS spectroscopic data. The large spectroscopic 
catalog provided by the SDSS (Figure [2]) enables studies of the photometric redshift quality, 
the effects of projection on the cluster finder, and of the richness- velocity dispersion relation. 



6.1. Tests of Photometric Redshifts 

Determining the physical properties of observed clusters requires accurate photometric 
redshifts. To assess the quality of z photo for a particular cluster, its true redshift z spectro 
must be known from spectroscopy. For this test of photometric redshifts, we use the clusters 
with BCG spectroscopic redshifts. Since the BCGs are bright, they are more likely to 
be targeted for spectroscopy than an average cluster galaxy. In addition, the SDSS LRG 



target selection (lEisenstein et al.ll200ll ) explicitly constructs a volume-limited sample of the 
brightest red galaxies, many of which are BCGs. The combination provides good statistics for 
the evaluation of z v h oto . Figure [8] compares all available BCG spectroscopic redshifts (zbcg) 
to cluster photometric redshifts (z p h oto ) for objects in four different N^ 00 richness ranges. 
Figure M shows the comparison of z p hoto to z spectro in a series of different redshift ranges. The 
dispersion in z spectro — z p hoto is — 0.01, essentially independent of redshift. There is a small 
bias (~ 0.004), in the sense that estimated redshifts are slightly higher than measured ones 
at all redshifts. We correct for this small bias in the photometric redshift estimates provided 
in the final catalog. The errors throughout are well represented by a Gaussian. Photometric 
redshifts for the richest half of the clusters are slightly better, with a typical dispersion in 
Zspectro — z p hoto of about 0.008. This photometric redshift accuracy allows clusters separated 
by more than about 50 hr x Mpc along the line of sight to be correctly identified. 
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6.2. Tests of Projection 

The projection of foreground and background galaxies onto the observed cluster popu- 
lation is a concern for all optical cluster selection algorithms. There are two levels at which 
projection occurs. First, there is large scale projection, in which galaxies more than a few 
tens of Mpc from a cluster and randomly projected on it are mistaken for cluster members. 
This large scale projection can easily be tested for using spectroscopic redshifts, and is an 
effect one might aspire to completely avoid. Second, there is small scale projection, in which 
galaxies near the cluster, along infalling filaments for example, are thought to lie within the 
cluster. Spectroscopic information about cluster membership on these small scales is lim- 
ited by redshift space distortions. This kind of projection is best constrained using realistic 
simulations. 

By relying on the tight clustering of E/SO galaxies in color-magnitude space, the 
maxBCG cluster finder limits the influence of projection, but it remains important to quan- 
tify this source of contamination. Direct study of projection also provides insight into the 
limitations of red-sequence cluster-finding and may suggest algorithmic improvements. 

To study the incidence of projection in this catalog, we take advantage of the large 
number of spectroscopic redshifts available for both BCGs and member galaxies identified by 
the algorithm. We begin by identifying the 3057 clusters which have spectroscopic redshifts 
for both the BCG and at least one member galaxy. The first step is to define a "best" 
redshift for each of these clusters, u Zb es t" ■ When a cluster has at least 3 member galaxy 
spectra, we take the median redshift of all the cluster member galaxies as z^ est . If there are 
fewer than 3 member redshifts, we define z^ est to be the BCG redshift. To test for projection 
we then compare individual BCG and member spectroscopic redshifts to these estimates of 
Zbest- Our determination of Zb es t is far from perfect; often it is measured from just a few 
member galaxy redshifts or from only the redshift of the BCG. Because of this, we further 
split the spectroscopic sample into the 143 clusters with at least 10 member galaxies located 
within ± 2000 km s -1 of the median redshift and the remaining 2914 clusters with fewer. 
Comparing the projection results obtained with these two samples provides a check on the 
influence of uncertainty in the determination of Zbest on our conclusions. 

We first measure the incidence of BCG projection. Since we require the BCG to be the 
brightest galaxy in a cluster, one might worry that we would occasionally select as a BCG 
a bright foreground galaxy projected onto the E/S0 ridgeline of a more distant cluster. To 
test for projection, we compare zbcg to Zb es t in the top two panels of Figure [TOl The top left 
panel shows the comparison for the 143 clusters with the best determined median member 
redshifts, while the top right panel shows the comparison for the remainder, including only 
those with between 3 and 10 member galaxies with spectra; those with Zb es t — z bcg are 
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removed altogether. The fraction of BCGs with velocities differing from the median by more 
than 2000 km s" 1 is 5.6% (8/143) for the best-measured set and 13% (117/910) for the 
remainder. While BCG projection does occasionally occur, it is not particularly common. 
When it does happen, it is most common to find an unusually red foreground galaxy projected 
onto a more distant cluster. BCG projection is also somewhat less likely as clusters increase 
in richness. 

To study the importance of projection for cluster member galaxies, we compare individ- 
ual member redshifts, z member , to the best estimate of the cluster redshift, z best . Again, we 
conduct the comparison both for the clusters with the best determined Zf, es t and also for the 
remainder, including only those with between 3 and 10 member galaxies with spectra.. These 
comparisons are shown in the lower two panels of Figure [TUl In the best-measured clusters, 
16% (427/2701) of member galaxies are found with velocities differing from the median by 
more than 2000 km s -1 . For the less well-measured clusters, 16% (829/5068) have velocities 
differing from the median by more than 2000 km s -1 . Since the determination of z be st is not 
especially robust for these clusters, this represents an upper bound on the degree of member 
projection. 

This test demonstrates the relative insensitivity of red-sequence methods to large-scale 
projection effects. No more than 13% of BCGs and 16% of the galaxies identified as cluster 
members according to color and spatial location are actually observed in projection. The 
majority of projected members lie behind the cluster; they are intrinsically bluer galaxies 
at higher redshift. Narrowing the color window for membership can reduce the inci dence of 



projection, but as shown in the accompanying algorithm paper (IKoester et al.l 120061 ) . it also 
increases the fragmentation of dark matter halos. 

The conclusions drawn here are restricted somewhat by the realities of the SDSS spec- 
troscopic selection. Galaxies for which spectra are obtained are preferentially bright cluster 
members, and there is a small fiber-collision bias against the measurement of galaxies in the 
densest regions. Since the member galaxies selected here are all brighter than 0.4 L* and 
the probability of cluster membership is highest in the densest regions, the effect of these 
limitations on our conclusions should be minimal, but compli cates measurements such as the 



radial dependence of member projection (IMcKay et al.ll2006l ) 



6.3. Relating Richness and Velocity Dispersion 



In addition to their usefulness in confirming cluster redshifts, spectroscopic samples of 
galaxies can be applied to understand the relation between richness and cluster velocity 
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dispersion. In the current catalog, the number of E/SO ridgeline galaxies, N^ 00 is a basic 
indicator of cluster richness. To understand the relationship between this cluster richness 
and velocity dispersion, we begin with a list of clusters for which a spectroscopic redshift 
for the BCG is known. Around each such BCG, we search for any other galaxies with 
measured spectroscopic redshifts, and define spectroscopic "pairs" ; each of which includes 
a BCG and a nearby tracer of the velocity field. For this study we keep all pairs in which 
the neighboring galaxy lies within a projected distance R 2 oo of the BCG and within ±7000 
km s _1 of the BCG velocity, irrespective of whether t he neighboring galax y is a member of 
the E/SO ridgeline; this i?2oo is the same derived from lHansen et all (120051 ) for photometric 
data. In many cases, a cluster will contribute only a few pairs, making the determination 
of individual velocity dispersions impractical. As a result, we gather together all pairs 
for clusters of similar richness, and hence measure the average pair-wise velocity difference 
(PVD) structure around a class of similar clusters. This stacking approach is analogo us to 
that applied in th e study of halo masses in isolated galaxies by iMcKay et all (120021 ) and 
Prada et all J^OoJ. 

Examples of the velocity structure seen around these clusters in several N^ 00 bins are 
shown in Figure [TTJ These PVD histograms are approximately Gaussian, but show clear 
evidence for components both narrower and broader than the average. This probably arises 
from the imperfect mapping between N^ 00 and mass. If all clusters in a narrow bin of 
N^ 00 had the same mass, we would expect a very nearly Gaussian PVD. If, however, the 
bin contains a mix of both lower a nd higher mass obj ects, the PVD will include a mix of 
narrower and broader components (jScoccimarrol 12004 ). To determine the typical velocity 
dispersions in each bi n, we fit two Gaussi ans plus a constant to the PVD histogram. The 



constant is shown b y IWoitak et a 



simulations (see also lMcKay et al 



(120061) to adequately account for unbound particles in 



20061 ) . The dispersion is then square root of the weighted 



average of the variances of the two Gaussians. The weights of each G aussian are determine d 
in the fitting procedure (the Expectation Maximization algorithm, see lConnolly et al.ll2000l ). 
Two Gaussians are used in the fit more as a convenient and stable parametric fit to the PVD 
histogram. They allow us to fully measure the second moment of the PVD histograms and 
more importantly, to measure their kurtosis. The kurtosis is an important parameter that 
allows one to constrain mass mixing in these PVD histograms. 

Median velocity dispersions for clusters binned by N^ 00 are shown in Figure [12] . There 
is a clear increase in measured velocity dispersion with richness N^J 00 . This increasing ve- 
locity dispersion is related to the increasing mass of the sample, reinforcing the idea that 
N^ 00 is a useful proxy for mass. This richness measure is simple; just a count of the number 
of bright E/SO ridgeline galaxies in a cluster. A variety of refined richness measures can 
be derived from this data, including matched filter likelihoods, member galaxy luminosities, 
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and total optical luminosities. It is likely that combinations of luminosity and concentra- 
tion measures will provide useful mass estimators for clusters, just as they do for individual 
elliptical galaxies (jZaritsky et al.l 120061 ) . It should also be useful to include measures of the 
diffus e optical light in the clusters, as this may include as much as 40% of the total optical 
light (IZibetti et al.ll2005l ; iGonzalez et al.ll2005l ; iKrick et al.l 120061 ) . A more detailed descrip- 
tion of these richness-dispersion scaling relations along wit h their extension to cluster mass 
and mass scatter estimates is given in a companion paper (IMcKay et al.ll2006l ). Additional 
information about the masses of clusters detected in the SDSS can be provided by we a k lens - 
ing measurements, as was done for a much smaller cluster sample in ISheldon et al.l (120011 ). 
The combination of lensing and dynamical mass calibration will provide the input needed 
for estimation of the cluster mass function. 



6.4. Measurements of Space Density 

ACDM simulations reveal that the space density of dark matter halos is nearly constant 
out to z ~ 0.3, with only a small increase in the number of objects at the high end of the mass 
function at late times. In Figure IT3| the comoving number density of clusters in a given 
bin is plotted as a function of redshift. No correction for purity or completeness is applied. 
It is approximately flat in all bins, as predicted by theory. This is a simple demonstration 
of the roughly volume-limited nature of the catalog. Measurements on simulations indicate 
that the slight decrease seen in the highest bin is due to evolution in the mass-iV^[? 

relation or a redshfit- dependent bias in the definition of 0.4L* and not to incompleteness at 
high redshift. Before proceeding to use this measurement as a tool to constrain cosmology, 
careful calibration must be made of the richness-mass relation and especially any possible 
dependence of this relation on redshift. The details of this mass calibration will be presented 
in future papers on dynamical and weak lensing mass estimators. 



7. Comparison to Known X— ray Selected Clusters 

An important test of any cluster finder is the extent to which it identifies those clusters 
found by other means. Optically-selected catalogs generated by alternative codes with mea- 
surements, sky, and redshift coverage similar to the current catalog are not yet available, but 
will in the future be invaluable in understanding selection biases in maxBCG. X-ray sur- 
veys are attractive for this purpose, in that they are dependent on different cluster physics, 
have mass proxies, accurate redshifts, and large sky coverage. Just as in comparisons of 
optically-identified clusters to mock catalog halos, some care must be taken. Optical and 
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X-ray methods may identify d ifferent cluster centers, and while op tical richness and X-ray 
luminosity are clearly coupled (ILin et al.ll2004j ; iPopesso et al.ll2005l ). there is significant scat- 
ter in this relation. Nevertheless, we expect that in the field of X-ray bright clusters there 
should be some significant overdensity of galaxies that is approximately centered on the X- 
ray peak. With this in mind, we investigate the extent to which known X-ray clusters were 
identified in this catalog. Because the maxBCG cluster-finder centers clusters according to 
the BCG location, we will compare our BCG location to the X-ray peak location. 

Comparison of optical and X-ray catalogs is complicated by several factors. First, cen- 
tral galaxies in clusters are also not always perfectly coincident with peaks in X-ray emission, 
and the identification of a single cluster BCG in merging systems is often ambiguous. The 
X-ray brightest clusters are also especially likely to contain BCGs exhibiting unus ual colors, 
often due to cooling flows with accompanying star formation or AGN activity (I Crawford 
20041 ). In addition, the optically-selected sample presented here extends to much lower mass 
objects than existing large area X-ray catalogs. For these reasons, we do not use the X-ray 
sample to quantify the purity of the optical sample, but merely as a check of our ability to 
locate most previously identified clusters. 



W e conduct our compar ison to the combined NORAS (IBohringer et al.ll2000l ) and RE- 
FLEX ( Bohringer et all2004 ) catalogs. These catalogs are purely X-ray selected, flux limited 
to about 3 x 10~ 12 ergs cm -2 s _1 at energies 0.1 — 2.4 keV, and when combined cover both the 
Northern and Southern Galactic caps. Within the redshift limits of this maxBCG catalog 
(0.1 < z < 0.3) and its sky coverage, the combined NORAS and REFLEX catalogs contain 
99 X-ray clusters. It is important to note that the NORAS catalog was based on early 
selection of extended sources and as a result is not as complete or pure a s the later REFLEX 
catalog. A new generation of the NORAS catalog is under construction (IBohringer H.II2006I ). 



To determine if an X-ray cluster was "found" in our maxBCG catalog, a cylinder cen- 
tered on the X-ray peak with a 2 Yr 1 Mpc radius and a depth \z xray — z p hoto\ < 0.05 is 
searched for optical clusters. For 94 of the 99 X-ray clusters a single, quite rich object and 
occasionally a few lower richness objects are found within these boundaries. The best match 
is defined by combining the richness of the optical object and the proximity to the center of 
the X-ray cluster. Despite the fact that a relatively deep redshift box is used for this match, 
the matches identified in this way have redshifts which agree with those given in NORAS 
and REFLEX with a dispersion of a z ~ 0.003, so they are clearly physically related. Because 
this exercise is automated, it differs from that in some previous comparisons of X-ray and 
optical properties. Here we compare optical centers identified with an automatic algorithm 
with X-ray centers, rather than visually examining each cluster and manually identifying 
BCGs. It shows that in many cases, the center selected by the automatic algorithm agrees 
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with the BCG which would be manually selected. For 76 of these 94 matches, the offsets 
between X-ray and BCG centers are less than 250 h^ 1 kpc. For these objects, the distri- 
bution of offsets has a median of 58 hr x kpc and a standard deviation of 57 hr 1 kpc. An 
additional 15 matches have offsets between 250 hr x kpc and 1 hr 1 Mpc. The remaining 
three matches have very large separations, between one and two Mpc. The separations larger 
than 250 hr 1 kpc occur for a variety of reasons, and each of these cases, along with the five 
not matched, is described individually below. 

The X-ray matching is summarized in Figure [TU This upper left panel in this figure 
shows the close correspondence between X-ray redshift and maxBCG photometric redshift. 
The upper right panel compares the overall richness distribution of the maxBCG catalog to 
the richness distribution of the X-ray matched clusters. While it is clear they are drawn 
preferentially from high richness objects, the effect is diluted by the flux-limited nature of the 
X-ray catalogs. The lower left panel compares the projected separation between X-ray and 
BCG centers to the cluster redshift. The tightly clustered group of matches with separations 
< 250 h~ x kpc is clear, along with the smaller number of large separation matches. The lower 
right panel compares the optical richness N^ 00 to the X-ray luminosity for these clusters, 
drawing a distinction between the close matches and those which are more distant. The 
correlation is poor, and will be undertaken in a future study. Point source contamination 
and the chosen physical scale are among the complicating factors. 

A case by case comparison was undertaken for each match separated by 250-2000 
h^ 1 kpc. The full details of this comparison are reserved for a future work which exam- 
ines the relationship between optical and X-ray properties of clusters. The 23 clusters for 
which the maxBCG cluster location differs from the NORAS/REFLEX X-ray center by 250- 
2000 h~ 1 kpc fall into several categories: complications in centering the X-ray flux due to 
point source contamination and mergers, maxBCG centering complications due to the lack 
of a dominant BCG, BCGs with evidence of cooling flows which produce rather blue colors, 
and matched maxBCG clusters with richnesses below the N T g ^ threshold. In these cases, 
the X-ray cluster is not missed; the centers chosen by optical and X-ray means disagree. 

A final concern in this matching exercise is the incidence of matches by chance. The 
large radius matches (250-2000 hr 1 kpc) of maxBCG clusters to X-ray clusters may be the 
most subject to chance projections. To investigate this, we place 13,823 random points in the 
SDSS footprint with redshifts drawn from the maxBCG redshift distribution and positions 
non-coincident with maxBCG clusters within a projected 2/i -1 Mpc. The same matching 
routine is run to match these points to the NORAS/REFLEX X-ray catalog. Of the 99 
NORAS/REFLEX X-ray clusters, we find matches between 250/r 1 and lOOO/rtpc, and 
6 between lOOO/i" 1 and 2000/?r 1 kpc. In the same ranges for the maxBCG clusters, we find 
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18 between 250h 1 and lOOO/i l kpc and 5 between lOOO/i 1 and 2000/i 1 kpc. This simple 
test indicates that outside l/i _1 Mpc, our matches are more subject to chance projections 
with the NORAS/REFLEX X-ray sample. 

This simple matching exercise shows that in the catalog presented here, we are ~ 80% 
successful at automatically identifying the BCGs selected manually by visual selection in 
bright X-ray clusters. This comparison also highlights some of the difficulties encountered 
in opti cal X-ray comparison s. Many X-ray bright clusters have emission line BCGs (perhaps 
25%; (jCrawford et al.lll9991 )) at their centers. Cluster-finders that search for red-sequence 
BCGs may miss these centers. Future cluster finding algorithms based on the red sequence 
may choose to account for this by searching for BCGs with unusually blue colors, although 
doing so invites contamination by foreground galaxies. In addition, there are examples in 
the X-ray catalogs of clusters without a single unambiguous BCG, and of clusters where the 
visually apparent BCG does not coincide well with the peak of X-ray emission. In some 
cases this may be due to contamination of the X-ray signal by cluster galaxy AGN emission. 
Unrelaxed clusters or those undergoing mergers are especially prone to this. The cases in 
which the optical cluster finder did not automatically identify the visually selected BCGs 
come from a mix of these issues. They are a combination of the algorithm used for cluster 
finding and the complex cluster physics that determine the BCG location, spectrum, the 
optical galaxy membership, and the X-ray signal. 



8. Discussion 



We have presented a new catalog of clusters of galaxies selected from SDSS photometric 
data using the maxBCG technique, which represents the largest galaxy cluster catalog as- 
sembed to date. This technique utilizes the clustering of galaxies on the sky, in magnitude, 
and color to identify groups and clusters of bright E/SO ridgeline galaxies. It identifies ob- 
jects ranging in size from single isolated ellipticals to the richest galaxy clusters. We have 
extracted from this very large list a sample of 13,823 clusters containing at least 10 E/SO 
ridgeline galaxies brighter than 0.4 L*. Running this cluster finder on realistic mock galaxy 
catalogs allows us to show that this cluster sample is more than 90% pure and more than 90% 
complete for halos with masses > 2 x 10 14 /i _1 Mq. Comparison between maxBCG photo- 
metric redshifts and SDSS spectroscopic redshifts for BCGs demonstrates the precision and 
accuracy of the derived cluster redshifts. The large volume of SDSS spectroscopic data also 
allows, in a somewhat limited way, an examinination of the effect of projection on both our 
identification of brightest cluster galaxies and on our determination of cluster membership. 
In both cases we show that large scale projection plays only a small role in red-sequence 
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cluster detection. 

We have shown that the basic richness measure presented here, the scaled number of 
bright E/SO ridgeline galaxies N^ 00 is strongly correlated with cluster velocity dispersion. 
Comparison of this optically-selected catalog with the existing NORAS and REFLEX X-ray 
selected cluster catalogs reveals that nearly all of these relatively X-ray luminous objects 
are detected among the richest of the optical clusters found here. As comparable wide-angle 
imaging cluster catalogs with accurate redshifts become available, these comparisons will be 
extended to catalogs selected by alternate optical algorithms as well. 

Further aspects of this cluster catalog, including galaxy populations and profiles, mass 
calibration by both dynamical and lensing measurements, diffuse light measurements, and 
the cosmological constraints it can place will be examined in future papers. 
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Fig. 1. — The distribution of galaxies in g — r color and i magnitude in the input galaxy 
catalog is displayed in this figure. Overlaid on the figure are the cuts used to extract a 
sample of potential E/SO ridgeline galaxies in the search redshift region. The gray scale is 
linear in the number of galaxies at each color and magnitude. 
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Fig. 2. — This figure shows the distribution of galaxies in the input spectroscopic catalog 
in g — r color and redshift. The close correspondence between redshift and g — r color for 
the E/S0 galaxies is apparent in the strong sequence along the red edge of the distribution. 
This forms the basis for the excellent photometric redshifts obtained for maxBCG clusters. 
The gray scale is linear in the number of galaxies at each color and redshift. 
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Fig. 3. — This histogram displays the differential number counts for clusters as a function of 
richness for the full maxBCG group and cluster catalog (the solid line). The cluster catalog 
presented here includes all objects in the shaded region. 
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Fig. 4. — This figure shows the number of clusters as a function of redshift for the maxBCG 
cluster catalog. The solid line shows the expectation for a volume-limited sample with a 
density of 2.3 x 1CT 5 clusters h 3 Mpc -3 in a standard ACDM cosmology. 
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Fig. 5.— SDSS data in the region -1.25° < Dec < 1.25°and with RA < 100° or RA > 300°, 
the SDSS southern equatorial stripe, is displayed in this 'pie diagram'. The left hand panel 
shows the locations in RA and photometric redshift of maxBCG cluster centers. The right 
hand panel shows RA and spectroscopic redshift of SDSS luminous red galaxies (LRGs). 
This slice contains 492 clusters, about 3.5% of the total catalog. Circles drawn at z=0.1 and 
0.3 show the boundaries of the maxBCG catalog redshift range. The approximately volume- 
limited nature of the cluster catalog is apparent. There are several features which look like 
'fingers of god'. These are partly generated by the ±0.01 uncertainties in the photometric 
redshifts. Their appearance is exaggerated when they enhance real features in the galaxy 
distribution, as in the two examples outlined by the ellipses added to both figures. 
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Fig. 6. — Basic results of purity tests based on mock catalog studies. In each plot the solid 
line is for the full mock catalog, the dotted lines for halos of 0.1 < z < 0.2 and dashed lines 
for halos at 0.2 < z < 0.3. The top panel is a purity plot for cluster matching fractions 
f c = 0.3, the bottom for f c = 0.5. In each case a cluster is called 'real' if a fraction of at 
least f c of its E/S0 ridgeline members is contained within R 2 oo of a single dark matter halo. 
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Fig. 7. — Basic results of completeness tests based on mock catalogs. In each plot the solid 
line is for the full mock catalog, the dotted lines for halos of 0.1 < z < 0.2 and dashed lines 
for halos at 0.2 < z < 0.3. The top panel is a completeness plot for a halo matching fraction 
f h = 0.3, the bottom for f h = 0.5. In each case, a dark matter halo is considered found if a 
fraction fh of its red sequence members is found in a single identified cluster. 
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Fig. 8. — The comparison of BCG spectroscopic redshift, zbcg-, to cluster photometric 
redshift, z p hoto, for 5413 objects, divided into several different richness N^J 00 ranges: 10 < 
N S < 20 (upper left), 20 < Njffi < 30 (upper right), 30 < Njffi < 40 (lower left), 40 < 
Ngais (lower right). Note the one object in the upper right panel which has a spectroscopic 
redshift ^0.35 and a photometric redshift ^0.21. This is the BCG of a cluster actually at 
z=0.21. The spectrum of the central galaxy (SDSS J075137.2+325447.4) shows a typical red 
galaxy spectrum with z=0.21 overlaid with strong emission lines from a (possibly lensed) 
background object at z=0.355. In this case, the photometric redshift in the cluster catalog 
is correct. 
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Fig. 9. — The difference between measured BCG spectroscopic redshift and cluster estimated 
photometric redshift for a total of 5413 clusters with N^ 00 > 10 for which spectroscopic BCG 
redshifts exist. Each panel shows the difference between spectroscopic and photometric 
redshift for a small bin of spectroscopic redshift. Gaussian fits to each distribution are 
overlaid. The fit dispersions range from a z =0.00Q to cr 2 =0.011. The small (A 2 ps 0.004) 
average bias seen here is subtracted in the final catalog. 
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Fig. 10. — Residual plots of BCG (zbcg) an d member (z m em&er) projection in maxBCG 
clusters. The top left panel compares BCG redshifts to median member redshifts, zi> es t, 
in the 143 cases where at least 10 member galaxies are found within ± 2000 km s _1 of the 
median. Only 8/143 (5.6%) of these BCGs are found with velocities differing from the median 
by > 2000 km s _1 . The upper right panel is the same comparison for the remaining 2914 
clusters, for which the apparent BCG projection fraction is 117/910 (13%) (excluding those 
where Zb est = zbcg)- The lower panels show similar comparisons for all member galaxies with 
spectroscopic redshifts. The lower left compares z mem b er to z best for the 143 well-measured 
clusters, while the lower right does the same for the remaining clusters. A fraction 427/2701 
(16%) of members are seen to be projected in the well-measured cases, and 829/5068 (16%) 
in the remainder. 
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Fig. 11. — This figure shows examples of the velocity distribution seen relative the the BCGs 
in clusters with various N r ™. While the distribution is approximately Gaussian, it is much 
better fit by a combination of two Gaussians. The characteristic dispersions shown in the 
legends are the appropriately-weighted averages of the two best fit Gaussians. 
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Fig. 12. — This figure shows the dependence of stacked velocity dispersion on scaled 
richness A^° s °. The steady increase of stacked velocity dispersion with richness illus- 
trates the connection between A^° s ° and mass. The best fit power law for this data is 
In cr = (5.52 ± 0.04) + (0.31 ± 0.01) lnTV^oo. At the Njffi > 10 threshold for this cluster 
catalog, the typical velocity dispersion is w 500 km s l . 
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Fig. 13. — Comoving number density of objects of varying richness. The solid top line 
is the space density for the full catalog. The lines below represent the space density in 
ranges of richness. From top to bottom these ranges are: 10 <A^° S °< 20, 20 <A^ S °< 43, 
43 <N r l^< 91, 91 <AT^o°< 189. Poission error bars are overplotted. 
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Fig. 14. — This figure summarizes the results of comparisons of the maxBCG catalog to 
the combined NORAS and REFLEX catalogs. The upper left shows the difference between 
NORAS/REFLEX spectroscopic cluster redshifts and the matching maxBCG photometric 
redshifts. The dispersion of the best fit Gaussian (shown overlaid) is cr z =0.004. The upper 
right shows the richness distribution of the full maxBCG catalog compared to the richness 
distribution for the clusters which match NORAS/REFLEX sources. The lower left shows 
the projected separations between X-ray and optical centers as a function of redshift. The 
lower right compares X-ray luminosity (in units of 10 43 from NORAS/REFLEX to the A^° s ° 
optical richness. Plus symbols are the 73 clusters with X-ray optical offsets < 250 h~ l kpc, 
diamonds are the 14 with offsets from 250 to 1000 hr l kpc. 
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Table 1. Information available in the online cluster catalog 



Column Name 


Data Type 


Unit 


Description 


RA 


Float 


Deg 


BCG Right Ascension (J2000) 


DEC 


Float 


Deg 


BCG Declination (J2000) 


z 


Float 




Photometric redshift 


BCGspecz 


Float 




Spectroscopic BCG redshift 


L* CG 


Float 


10 10 L 


BCG r-band luminosity 


L BCG 


Float 


10 10 L 


BCG i-band luminosity 


jjnem 


Float 


10 10 L 


Total r-band luminosity 


jjnem 


Float 


10 10 L 


Total i-band luminosity 




Int 




Detection richness 


ivri?200 
gal 


Int 




Scaled richness 



Note. — This table describes the information provided for each cluster 
in the online catalog 



